On building a concatenative speech synthesis system from the blizzard challenge speech databases
نویسندگان
چکیده
In this paper, we compare two methods of building a concatenative speech synthesis system from the relatively small, “Blizzard Challenge” speech databases. In the first method we build a system directly from the Blizzard databases using the IBM Concatenetative Speech Synthesis System originally designed for very large speech databases. In the second method, a larger database is used to build the synthesis system and the output is “morphed” to match the speakers in the Blizzard databases. The second method outperformed the first while maintaining the identity of the Blizzard target speakers.
منابع مشابه
The IBM Submission to the 2006 Blizzard Text-to-Speech Challenge
In this paper, we present two concatenative text-to-speech systems built from the “Blizzard Challenge” speech databases. The two systems differ primarily in their segment selection cost function. One system has our baseline cost function, and the other has a cost function which has been altered to potentially better handle small datasets. Results indicate that both systems perform similarly in ...
متن کاملThe VoiceText Text-to-Speech System for the Blizzard Challenge
This paper introduces the VoiceText text-to-speech system developed by Voiceware. By means of corpus based concatenative speech synthesis technique, we built high quality synthetic voices using the dataset provided for the Blizzard challenge 2007. The evaluation results show that VoiceText achieved high performances in both naturalness and intelligibility of synthesized speech.
متن کاملMILE TTS for Tamil for blizzard challenge
Our participation in the Blizzard Challenge 2014 is only for the Tamil language. We have a unit selection based concatenative speech synthesis system. Sentence level viterbi search is used to select the reliable speech units among a set of candidate units. The given RD (reading), SUS (semantically unpredictable sentences) and ML (multi‐lingual) test sentences are synthe...
متن کاملThe GlottHMM Entry for Blizzard Challenge 2012: Hybrid Approach
This paper describes the GlottHMM speech synthesis system for Blizzard Challenge 2012. The aim of the GlottHMM system is to combine high-quality vocoding and detailed prosody modeling in order to produce expressive, high quality, synthetic speech. GlottHMM is based on statistical parametric speech synthesis, but it uses a glottal flow pulse library for generating the excitation signal. Thus, it...
متن کاملمراحل و نحوه ی تهیه ی دادگان های صوتی هجایی و دایفونی برای سامانه ی تبدیل متن به گفتار فارسی
Abstract Speech databases are part of the concatenative text to speech synthesis systems. Phonetic quality of the databases plays a significant role in the naturalness of the synthesized speech. This paper introduces two syllable and diphone speech databases for Persian and investigates the way of their development and their specifications and their advantages to each other. ...
متن کامل